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Abstract 

Razborov and Rudich have shown that so-called natural proofs are not useful for separating P from NP unless hard 
pseudorandom number generators do not exist. This famous result is widely regarded as a serious barrier to proving 
strong lower bounds in circuit complexity theory. 

By definition, a natural combinatorial property satisfies two conditions, constructivity and largeness. Our main re- 
sult is that if the largeness condition is weakened slightly, then not only does the Razborov-Rudich proof break down, 
but such "almost-natural" (and useful) properties provably exist. Specifically, under the same pseudorandomness as- 
sumption that Razborov and Rudich make, a simple, explicit property that we call discrimination suffices to separate 
PI poly from NP; discrimination is nearly linear-time computable and almost large, having density 2"^'"^ where q is 
a quasi-polynomial function. (This is a slightly stronger result than the one announced in the FOCS 2008 extended 
abstract of this paper) For those who hope to separate P from NP using random function properties in some sense, 
discrimination is interesting, because it is constructive, yet may be thought of as a minor alteration of a property of a 
random function. 

The proof relies heavily on the self-defeating character of natural proofs. Our proof technique also yields an 
unconditional result, namely that there exist almost-large and useful properties that are constructive, if we are allowed 
to call non-uniform low-complexity classes "constructive." We note, though, that this unconditional result can also be 
proved by a more conventional counting argument. 

Finally, we give an alternative proof (communicated to us by Salil Vadhan at FOCS 2008) of one of our theorems, 
and we make some speculative remarks on the future prospects for proving strong circuit lower bounds. 
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1. Introduction 

In a famous paper |7], Razborov and Rudich introduced the concept of a natural combinatorial property of a 
Boolean function. They showed on the one hand that almost all lower bounds in circuit complexity theory proved up to 
that time (specifically, all non-relativizing, non-monotone, superlinear lower bounds) had employed natural properties, 
and on the other hand that natural properties cannot be used to separate P from NP unless 2"' -hard pseudorandom 
number generators do not exist. Their result is widely regarded as a serious barrier to proving strong circuit lower 
bounds. 

In more detail, if F and A are complexity classes, then Razborov and Rudich say that a property of Boolean func- 
tion on n variables is T-natural of density 6„ and useful against A if (roughly speaking) the property is F-computable 
(from the truth table of a given Boolean function), if it holds for 2^"S„ Boolean functions, and if it contains no A- 
computable Boolean functions. They showed that if F = A = PI poly and 6„ - Q(2"'^"''*"'), then no such properties 
exist unless 2"'^-hard pseudorandom number generators do not exist. Informally, if a property is constructive (F is 
sufliciently weak) and large (6„ is sufficiently large), then it is not likely to be useful for proving strong circuit lower 
bounds. 
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It follows that if we believe in hard pseudorandom number generators but still wish to prove circuit lower bounds, 
then we are led to ask just how non-constructive and/or small a property needs to be in order to circumvent the so-called 
"naturalization barrier." Rudich [9] has shown that if we allow ourselves to assume a stronger pseudorandomness hy- 
pothesis, then the naturalization barrier remains intact even if constructivity is weakened to A^P/^/joZy-constructivity. 
On the other hand, as pointed out by a referee of the FOCS 2008 extended abstract of this paper, for any fixed k there 
are properties computable in time 2" that are useful against circuits of size (simply use brute-force search). 

In this paper we investigate the weakening of the largeness condition. The main result is that under the same 
2"^-hard pseudorandomness assumption of the original Razborov-Rudich paper, we can explicitly exhibit a nearly- 
linear-natural property that separates NP from P/ poly and whose density is 2"'^*"' where ^ is a quasi-polynomial 
function (whose degree depends on e and on the size of the pseudorandom number generator). Of course, the pseu- 
dorandomness hypothesis trivially implies the existence of constructive properties that separate NP from P/ poly; for 
example, simply take an explicit family of A^P-complete Boolean functions. However, this latter family has density 
2-«(") for some function e(n) that grows exponentially; this is far smaller than 2"**"*. 

The main idea of our proof is to exploit the self-defeating nature of natural proofs. Assume that natural, useful 
properties do not exist (for example, by assuming that 2"' -hard pseudorandom number generators exist and invoking 
Razborov-Rudich). This means that every attempt to find a natural property that discriminates nonconstructive func- 
tions from constructive ones fails. The key observation is that a natural property is itself just a constructive function 
(a constructive function of a truth table, that is, but a truth table is just an arbitrary binary string). Therefore we have 
identified a feature that every constructive function has: It is no good at discriminating nonconstructive functions 
from constructive ones. So if we consider the property of discrimination, i.e., the ability to distinguish nonconstruc- 
tive functions from constructive ones, then discrimination is a useful property. On the other hand, it is easy to prove 
unconditionally that discrimination is almost large, and that discrimination is constructive!] Moreover, one can ex- 
pUcitly describe an NP function that is discriminating, so discrimination separates NP from Pj poly. This is our main 
result. 

The key point is that it is the very assumption that natural, useful properties do not exist that yields a useful 
property. 

One can ask whether the above line of reasoning can be used to prove an unconditional result, just as Avi Wigder- 
son adapted Razborov and Rudich's argument to prove unconditionally that there is no natural proof that the discrete 
logarithm problem is hard. Indeed, this is possible, as we show below. It turns out, however, that this unconditional 
result can be proved using a direct counting argument. 

We hope that these results will give some insight into how to bypass the naturalization barrier. If 2"' -hard pseu- 
dorandom number generators do not exist, then of course the naturalization barrier evaporates. On the other hand, 
if such generators do exist, then our results show that there exists at least one property (namely, discrimination) that 
separates NP from Pj poly and that is both constructive and — as we shall see shortly — only a minor alteration of a 
random property. 

2. Preliminaries 

We write N for the positive integers, and our logarithms are always base 2. All gates in our Boolean circuits are 
assumed to have just two inputs. We use the notation (x„) to denote a sequence x\,X2,..., and whenever we refer 
to a sequence (/„) of Boolean functions, we always understand that f„ is a function of n variables. Given a function 
/I : N ^ N, we write S IZE(A) to denote the complexity class comprising all sequences (/„) of Boolean functions for 
which there exists a constant c such that the minimum circuit size of /„ is at most cA{n) for all sufficiently large n. The 
following definition will also be convenient. 

Definition 1. Given two functions y : N — > N and /I : N — > N, we say that y outstrips A if for every constant c > 
there exists no such that y{n) > cA(n) for all n > «o- That is, y eventually grows strictly faster than any constant 
times A. 



Note that even if every discriminating function is non-constructive, the property of discrimination is constructive, because it is easy to check, 
given the truth table of a function /, whether / is a discriminating function. 
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Now let us review some fundamental concepts from f]^ . 

Definition 2. A Boolean function property (or just property for short) is a sequence C = (C„) where each C„ is a set 
of Boolean functions on n variables. 

Definition 3. If F is a complexity class and is a sequence of positive real numbers, then a property (C„) is 
Y-natural with density 6„ if 

1. (largeness) \C„\ > 2^"5„ for all sufficiently large n; and 

2. (constructivity) the problem of determining whether /„ e C„, given as input the full truth table of a Boolean 
function /„ on n variables, is computable in F. 

Note that our definition of natural differs sUghtly from that of Razborov and Rudich; for them, a natural property 
is one which contains a large and constructive property. This difference will do no harm, because our results assert 
the existence of certain natural properties in our sense, and a property that is natural in our sense is also natural in 
Razborov and Rudich's sense. 

Later on we will be particularly interested in the case of nearly-linear-natural properties, which we define to mean 
r - DTIME{N(logNy) for some constant c. Here we have used an uppercase to emphasize that "nearly linear" 
means nearly linear in = 2", the size of the truth table of /„. 

Next we recall the definition of a useful property. 

Definition 4. If A is a complexity class, then a property (C„) is useful against A if for every sequence (/„) of Boolean 
functions satisfying /j, E C„ for infinitely many n, (/„) i A. 

For our purposes we also need a slightly weaker notion, which we shall call quasi-usefulness. 

Definition 5. If A is a complexity class, then a property (C„) is quasi-useful against A if for every sequence (/„) of 
Boolean functions satisfying /„ e C„ for all sufficiently large n, (/„) i A. 

The difference between usefulness and quasi-usefulness is that there may be infinitely many n for which a quasi- 
useful property is easy to compute, whereas this cannot happen for a useful propertyjj However, a quasi-useful 
property retains the important characteristic of not containing any A-computable sequence of Boolean functions. So 
for the purpose of separating A from a higher complexity class, quasi-usefulness suffices. 

Note that the only reason we introduce quasi-usefulness is to handle the slightly annoying technicality that the 
length of a truth table is not an arbitrary integer but is always a power of two. An alternative way around this 
technicaUty might be to pad out strings whose lengths are not powers of two. 

Definition 6. Fix e > 0. A family of functions G„ : {0, 1)" — > {0, 1}^" is a 2"' -hard pseudorandom number generator 
if for every circuit C with fewer than 2"' gates, 

|Prob[C(G„(x)) = 1] - Prob[C(y) = 1]| < 1/2"'. 

Here x is chosen at random from {0, 1 )" and y is chosen at random from {0, 1 )^". 

We are now ready for Razborov and Rudich's fundamental result. We need a slightly stronger version of the 
theorem than the one that appears in their paper. 

Tlieorem 1 (Razborov-Rudich). Fix c > \, d > \, and e > 0. Assume that there exists a 2^^ -hard pseudorandom 
number generator Gk in S IZE(k^). Then for any e > l-i-cd/e, there is no P/ poly-natural property with density greater 
than 2~" that is useful against S IZE(n'^). 



^As pointed out by a referee, our distinction between useful and quasi-useful is the same as tlie distinction between diagonalization a.e. and 
diagonalization i.o. in tgj. 
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Proof. Only minor changes to Razborov and Rudich's argument are needed, but for completeness we give a full proof. 

Choose any e > 1 + cdje. We use our pseudorandom number generator G to construct a pseudorandom function 
generator /. For every k> 1 , let , : {0, 1 )^ ^ {0, 1 be the first and last k bits of Gk respectively. For the rest of 
the proof, we will write n for \}fl''l2\. For any A;-bit string x, let fix) be the Boolean function that sends y e {0, 1)" to 
the first bit of 

Gf oGf-' o...oG^>W. 

We claim that the family of functions [fix)} is in SIZEin"). This is because Gk is in SIZE(k^) c S IZE(n''^^), and it 
is straightforward to build a circuit for f{x) using n copies of G^ (with the ith bit of the input dictating which half of 
the ith copy of Gk to use). 

Now assume towards a contradiction that there exists a P/ poly-natmal property (C„) with density at least 2"" 
that is useful against SIZEin'^). Then for all sufficiently large k, none of the functions f{x) are in C„. Therefore if f„ 
denotes a randomly chosen Boolean function on n variables and x denotes a randomly chosen k-bii string, then 

|Prob[C„(/(x)) = 1] - Prob[C„(f„) = 1]| > l'"" . (1) 

Equation ([T]) gives us a statistical test for fix), which we now convert into a statistical test for Gk- Let T be a 
binary tree of height n, having 2" - 1 internal nodes and 2" leaves. Construct a labeling { of the nodes of T by labeling 
the leaves with (distinct) n-bit binary strings and labeling the internal nodes with (distinct) numbers 1 to 2" - 1 in such 
a way that if u and v are internal nodes and u is a child of v, then €{u) < {{v). If y is a leaf of T, then let €{y){ j) denote 
the jth bit of {{y). For / e {0, 1, . . . , 2" - 1), let T, be the subforest of T induced by the set of internal nodes v with 
({v) < i, together with all the leaves. If y is a leaf of T, then let Vi{y) be the root of the subtree of T, containing y, and 
let h(i,y) be the distance between v,(y) and y (so for example h{i,y) - Oif Vi(y) - y). 

Now define independent random variables x(v), one for each node v of T, and each chosen uniformly from {0, 1}*^. 
Define a random collection f,;„ by letting tijiiy) (for a leaf y of T) be the first bit of 

, (jmn-i) „ . . . „ Gf '<"-''<''>'>"''(x(v,-(3;))). 

Then fo,„ is f„ and f^'-i.n is /(x), so Equation ([T]i implies that for some /, 

|Prob[C„(f,_i,„)) = 1] - Prob[C„(f,>) = 1]| > 2-"72" > 2-2«", (2) 

since d> I. There must exist some assignment of the values of the x(v) for all roots v of subtrees of T, except the root 
u with i(u) - i, such that Equation (O still holds when conditioned on this assignment. By fixing such an assignment, 
we obtain a statistical test that distinguishes between Gic(x{u)) and (x{u'),x(u")), where u' and u" are the children 
of u, and that can be computed by circuits of size 2"^"' (because (C„) € P/ poly). But this contradicts the 2'^'-hardness 
of Gk, because for all sufficiently large k, is larger than In'' and also larger than any constant times n. □ 

Finally, we need some estimates for the size of i//{n,g), the number of Boolean functions of n variables that can 
be computed by Boolean circuits with at most g gates. The upper bound is due essentially to Shannon, though the 
version we quote here is Lemma 2.1 in jlOll . 

Proposition 1. For all n > 1 and g > 1, iff(n,g) < g^e^^'^". 

For the proof of TheoremHl we will also need a lower bound on ^{n,g). This result is somewhat technical and 
is not needed for the proofs of Theorem |2] or Theorem [3] so the reader can skip to Section [3] now without loss of 
continuity, returning to the lemmas below when needed. 

We need a couple of facts about binomial coefficients. These facts are well known to experts, but for completeness 
we give the proofs. The first fact is an elementary large-deviation result. 

Lemma 1. Ifk < {112 - e)N, then there is a constant c > (depending on e but not on N or k) such that 

§(:)"(:)■ 
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Proof. Let S denote the sum on the left-hand side of The ratio between consecutive terms in S is il(N - i + 1), 
and since / < A; < (1/2 - e)A^, it follows that 

^_ ^ (1/2 -.)A^ 
N-i+\ ~ (ll2 + e)N+\ 

The right-hand side of (|4]i is bounded by some constant strictly less than one. Therefore S is bounded by a convergent 
geometric series, and this proves the lemma. □ 

Lemma 2. Assume that k < N/2. If log < N/2, then k < N/4. 

As the proof below makes clear, Lemma |2] remains true if we replace ''N/2" by "(1 - e)N," provided we replace 
'W/4" by a suitable constant times and require that be sufficiently large. We do not need this extra generahty, so 
we have stated Lemma|2]with specific constants to make it easier to read. 

Proof. If A; = then the result is trivial, so assume that k ^ 0. Let H(x) :- -xlogx - (1 - x)log(l - x) be the 
entropy function. The basic reason why the lemma is true is that log (^'^^ x N ■ H(k/N). More precisely, by Stirling's 
approximation, 

log (^) . A. . Hik/N) . \ log ^ - 1 log 2. - . log . 

>_N.Hik/N).Uog^^-2. 



So iflog(^) < A^/2, then 



H{k/N) < i - — log — + —. 

^ ' ' 2 2N ^k{N-k) N 



The expression N/k(N - k) is minimized when k - N/2, and by elementary calculus we find that (l/2x) log(4/x) is 
minimized when x = 4e (remember that in this paper, our logarithms are base 2). Therefore, provided A^ > 10, 

1 log e 

H(k/N) <- + + 0.2 < 0.8. 

2 8e 

It follows that if A^ > 10, k/N < H-\0.8) < 1/4 as desired. If A^ < 10, then the lemma can be checked by direct 
computation. □ 

Now we are ready to prove a lower bound on i//(n, g). 

Proposition 2. Let y : N N be a function such that y(n) < 2""^/n and such that yin) outstrips n log n. Then for any 
fixed d, 4>{n, y{n)) > n''i//{n, y{n)/2)for all sufficiently large n. 

Proof. Let N - 2". We are trying to find a lower bound on how many more Boolean functions we can compute with 
y(n) gates than we can compute with only y{n)/2 gates. Our main observation is that by using 0{n) extra gates, we 
can change any single entry of the truth table of any given Boolean circuit: Simply use the 0{n) gates to test if the 
input equals a specific n-bit value, and flip the output of the circuit if it does. 

If B denotes the set of truth tables of functions computable with at most y(n)/2 gates, then our main observation 
implies that if we are allowed up to y{n)/2 + 0{n) gates, then at minimum we can also compute all the functions on 
the boundary G{B) of B, i.e., the truth tables whose Hamming distance from Z? is 1. We know very little about the 
structure of B, but we do have an estimate of its volume, so we can obtain a lower bound on the size of its boundary 
by appealing to a discrete isoperimetric inequality. In particular, it follows from standard result^ that if we choose k 



^See for example Bezrukov's survey paper Bezrukov states an isoperimetric inequality for the inner boundary r(B), but this can be converted 
into an inequality for G(B) as follows. In the notation of Bezrukov's paper, we may assume that B is an optimal set Lj^ for some m. Then the 
radius-(j(: + 1) Hamming ball 5f^j (0) c B u G(B), so if we let = |B U G(B)\, it follows that as long a.sk+\< N/2, 



\nBUG(B))\ > |r(Lf)| > |r(sf,,(0))| = (^^ J. 



On the other hand, r(S U G{B)) c G(B) so |r(B U G(B})\ < \G{B}\. 



so that 

k+l 



!=0 ^ ' /=() ^ ' 



then |G(fi)| > We claim that there is some constant c such that \B\ < c |G(Z?)| for all large «. To see this, note that 
since y(n) < N/4n, Proposition [T] implies that for large n, 

, . . N , N / N \, 2N , N N I log4n\ 

But (Is) yields the lower bound |B| > so by Lemma|2] k < N/A. This fact, together with the upper bound on \B\ 

given by (|5]), implies (by Lemma[T]i that \B\ is bounded by a constant times Since \G{B)\ > (^'^J, our claim is 

proved. 

So when an additional 0{n) gates are allowed, the number of computable functions is multiplied by at least some 
constant factor K > 1. Now in fact we have y{n)l2 additional gates at our disposal, and y{n)l2 outstrips nlogn, so 
the multiplicative factor is greater than for any constant c, and this eventually grows faster than for any 

fixed d. □ 



3. The main result 

Theorem 2. Assume that, for some e > 0, there exists a 2"' -hard pseudorandom number generator G in Pj poly. Then 
there exists a quasi-polynomial function q and a nearly-linear-natural property of density Q(2"**"') separating NP 
from Pj poly. 

In fact, as will be apparent from the proof, the property we exhibit contains functions that are probably not A^P- 
hard, so our separation is actually stronger than NP ^ PI poly. 

The main tool in our proof of Theorem|2]is the following concept. 

Definition 7. Given y : N — » N, we define a Boolean function / on n variables to be y-discriminating if either of the 
following two conditions holds: 

1 . n is not a power of 2. 

2. n - 2"' for some m and 

(a) fix) = 1 for at least 2"/n values of (the n-digit binary string) x, and 

(b) /(x) = if X is the truth table of a Boolean function on m variables that is computable by a Boolean circuit 
with at most y{m) gates. 

If we let Ml be the set of all y-discriminating Boolean functions on n variables, then (Mj) is a Boolean function 
property that we shall call y-discrimination. 

The following easy lemma shows that y-discrimination is constructive, and gives a lower bound on its density. 

Lemma 3. Let y : N — > N iie a time-constructible function satisfying y(m) < 2'"/t7ifor all m. Then y-discrimination 
is a nearly-linear-natural property with density Q(2~'''('°sn.r(iog«))) 

Proof. Let n denote the number of variables of our Boolean functions. If n is not a power of 2 then the lemma is 
trivial, so assume that n - 2"\ 

First we note that, since y(m) < 2"y»J, it is easy to deduce from Proposition [T| that the number of Boolean circuits 
with m inputs and at most yim) gates is much less than 2^" = 2". 

Let us check constructivity. To verify that a given truth table is the truth table of a y-discriminating function, we 
must check that the fraction of entries equal to 1 is at least 1 /n, and we must also check that the entries indexed by 
truth tables of functions computable by circuits with at most y{m) gates are 0. Let = 2" be the size of the truth 
table. Counting I's clearly takes time that is nearly linear in A^, but to check the forced O's we must first compute 
y(m), then run through each possible Boolean circuit in turn, computing its n truth table values, and checking that the 
corresponding entry of the given truth table is 0. Since y is time-constructible, computing y{m) takes time 0(2'"), so 
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evaluating y aim — log log takes time at most polylogarithmic in A^. Enumerating all the circuits is a straightforward 
process, and the total number of circuits to be enumerated is at most A^, so the entire computation takes time at most 
multiplied by some factors that are polylogarithmic in A^. 
It remains to estimate the density. If we were to ignore condition 2(a) in the definition of a y-discriminating func- 
tion, then we would simply be counting functions that must be in certain positions and are unrestricted otherwise, so 
the total number of functions on n variables would be precisely 2^" From this we can get a lower bound for 
the true number of y-discriminating functions by subtracting off the total number of Boolean functions on n variables 
whose truth tables have at most 2"/n entries equal to 1. This latter quantity is 

2"/« /^„> 

2 } 

By Lemma[T] 

where the second equaUty is a routine appUcation of Stirling's approximation. It follows that for some constant c, the 
number of y-discriminating functions is at least 

'2^'^-ijj{m,y(nt}) _ 2^(2" log «)/« 

_ 22"2^'^("'' _ 2c(2"logn)/n-2"+i/<(m, y(m))-j 

Again, i^(m, y(m)) is vanishingly small compared to 2^° = 2", so the density is indeed eventually lower-bounded by a 
constant times 2-^*("'' . □ 

We are now ready for the proof of our main result. 

Proof of Theorem^ By hypothesis, there exists c > 1 such that the pseudorandom number generator Gk is in 
SIZE{k'^). Choose some number e > 1 + c/e, and let y be the function y(m) = m''. Then we claim that the de- 
sired property is simply y-discrimination. 

By Lemma[3]we know that y-discrimination is nearly-linear-natural with density Q(2"'^'^'°8"'^('°s«))). Since y is a 
polynomial function, Proposition[T|implies that this density is indeed Q.{2^'^^"^) for some quasi-polynomial q. 

We next show that y-discrimination is quasi-useful against P/ poly. Given /„ e Mj,, define the property (C„0 by 
letting a function with truth table x be in Cm if and only if /2" (x) - 1. Since / is a y-discriminating function, it follows 
that (Cm) is useful against SIZEiirf) and that (C,„) has density Q(2""'). Invoking Theorem [T] with d - 1, we see 
that (C,„) cannot be Pj po/y-constructive. In other words, (/„) ^ Pj poly, which means that y-discrimination is indeed 
quasi-useful against Pj poly. 

Finally, let (/,) be the sequence of y-discriminating functions that are only when forced to be by condition 2(b) 
and that are 1 otherwise. Then (/„) is in NP, in the sense that the language L defined by 

X e L <^ fnix) = 

IS m NP^ The reason is that, for n a power of 2, a Boolean circuit with truth table jc is a certificate for membership in L, 
and such a circuit has size y(log n), which is polynomial (even polylogarithmic) in n, the size of x. This completes the 
proof. 

Note that as we remarked earlier, (/„) is almost certainly not A^P-complete, so that we have actually separated 
Pj poly from a subclass of NP. □ 



"^Some authors might prefer to say that (/„) is in co-NP, but since we could have chosen to interchange the roles of and 1 in the definition of 
y-discrimination, this distinction is of no importance. 
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4. An unconditional result 



As we remarked in the introduction, the idea behind the proof of Theorem |2] can be adapted to prove a non- 
uniform version of the resuh that has no unproven hypotheses. Now, it turns out that this unconditional result can also 
be proven by a counting argument that does not use any self-reference. Since the two arguments are very different in 
flavor, we present both of them below. 

First we need a non-uniform version of Lemma[3] 

Lemma 4. Let y ; N — > N fee a function satisfying y(m) < 2"ymfor all m. Then y-discrimination is a non-uniformly 
linear-natural property with density Q(2-'A0og«,r(iog«))). 

When we say "non-uniformly linear-natural property," we of course mean that membership can be decided by 
circuits whose size is linear in the size of the truth table. 

Proof. The proof is the same as the proof of Lemma[3]except when it comes to F-constructivity. 

Let n - 2'" denote the number of variables of our Boolean functions. As we said before, to verify that a given 
truth table is the truth table of a y-discriminating function, we must check that the fraction of entries equal to 1 is at 
least 1 In, and we must also check that the entries indexed by truth tables of functions computable by circuits with at 
most y{m) gates are 0. Let - 2" be the size of the truth table. We can count the number of 1 's using 0{N) gates, 
for example by using carry-save addition Also, for each n, the set of truth table entries that must be is fixed, so 
this condition can be checked using a number of gates that is proportional to the number of forced O's (even if y is not 
time-constructible); this number is certainly 0(N). □ 

Theorem 3. Let -y, /I : N — > N be functions such that y outstrips A and such that m < y{m) < 2^/mfor all m. Let 
F = S IZE(y) and let A — S IZE(A.). Then there exists a Y -natural property (Cn) with density Q(2"'**'°s"'')'('°g"W) that is 
quasi-useful against A. 

Proof. We argue by contradiction. Assume, as a reductio hypothesis, that there is no F-natural property (Cm) with den- 
sity Q(2"'''''°s"''''('°g'"W) that is quasi-useful against A. Then we claim that y-discrimination is quasi-useful against A. 

To see this, pick an arbitrary sequence of functions /„ € Mj. Define a property (C,„) by letting a function of 
m variables with truth table x be in C,„ if and only if f2'«(x) - I. Then by condition 2(a) in the definition of a y- 
discriminating function, (C,„) has density f2(2"'"). By assumption, y(logm) > logm, and it is easy to see that there 
are more than m distinct Boolean functions computable with log m gates and log m inputs, so the density of (C„,) is 
Q^^2-i//(iogm,y(iogm))y gy condition 2(b), if gm 6 C„, is any sequence of Boolean functions, then the minimum circuit 
size of gi„ exceeds y(m), and hence (g,,,) i A since y outstrips \. In other words, (C,„) is quasi-useful (in fact, useful) 
against A. Therefore, by our reductio hypothesis, membership in (C,„) is not F-computable. It follows that (/„) i F, 
and a fortiori (/„) ^ A. Therefore (/„) is quasi-useful against A, as claimed. 

But since n < y(n) < 2"ln, Lemma g] tells us that {Mf^) is F-natural with density Q(2-'Aaog".r(iog"))). Combined 
with the quasi-usefulness against A that we just proved, this fact contradicts our reductio hypothesis, so the theorem 
is proved. □ 

Observe that a curious feature of the above proof is that it is highly ineffective. The natural property whose 
existence is asserted is not explicitly exhibited, nor can an explicit example be extracted from the proof, which is 
intrinsically a proof by contradiction. Note also that a 5/Z£'(y)-natural property is not necessarily "constructive" in 
the intuitive sense even if y is polynomial, because SIZE(y) is a non-uniform complexity class. Nevertheless, we feel 
that Theorem[3]remains of some interest because it is an unconditional result. 

Next we present the promised counting argument, which in fact yields a stronger result than Theorem[3j 

Theorem 4. Let y, A : N N be functions such that A(n) = Q(nlogn), y outstrips A, and y(n) < 2"^^/nfor all n. 
Let A - SIZE(A). Then there exists a non-uniformly linear-natural property with density at least l/i/f(n, y(n)) that is 
useful against A. 

Proof. Let us first give a somewhat informal proof that conveys the essential idea. Let N -2". 

As usual, think of Boolean functions on n variables as represented by their truth tables. Let G„ be the set of 
Boolean functions on n variables computable by circuits of size y(n)/2. For each g e G„, imagine a Hamming ball of 
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volume 2'^l4i{n, j{n)) centered at g (by a Hamming ball centered at g we mean the set of all Boolean functions within 
a certain Hamming distance from g). There are i^(n, y(n)/2) < t//{n, jin)) such balls, so the total volume of these balls 
is less than 2^ . Therefore there must exist a function f„ outside all of these balls. It follows that there is a Hamming 
ball B„ of volume 2^/tfr{n, y(n)) around /„ that is disjoint from G„. Then since y outstrips A, {B„) is a property that is 
useful against A. Its density is 1/tpin, yin)). Moreover, testing for membership in B„ amounts to computing Hamming 
distance from /„, which can be done with circuits of linear size. 

This completes the informal proof. The only point that is not entirely rigorous is the assumption that there exists 
a Hamming ball whose volume is exactly 2'^ I if/in, yin)); this may not be true because the volume of a Hamming ball 
is necessarily a sum of consecutive binomial coefficients. For a rigorous argument, we choose our Hamming balls to 
have radius r, where r is chosen so that 

where we have abbreviated i/r(n, y(n)) to i/f,, to ease notation. Then the property of being in B„ certainly has density 
at least l/ij/,,, so all that needs to be checked is that t//{n,y{n)/2) such Hamming balls have total volume strictly less 
than 2^, i.e., that 

<A(n,r(«)/2)^(^)<2'^- (7) 

To prove this, observe that we just need to show that the ratio {Yj'i=Q{^i^j{Yj'iZo is bounded by a polynomial 
function of n, because then (|7]i will follow from ^ and Proposition |2] Now 

So we are reduced to showing that (A^ + l)/r is bounded by a polynomial function of n. To prove this, remember that 
by assumption y{n) < N/n, so Proposition[T]implies that if/„ < (N/n)^^"e^^"^'^". Taking logarithms and dividing by A^, 
we deduce that 

log(/r„ 1 /I 4«\ logn /I 4«\ 

< - log — + - + — log e = 1 + ~ + — log e. 



n n \n N j n \n N 

The (log n)/n term in this expression dominates, so for large n, 



logiA„ logn 



On the other hand, from Q we have 

2^ ^ lN\ ^ ,■ N'^^ - 1 



<N' 



r+l 



Taking logarithms, we get A^ - log ilin <{r + \) log A^, which combined with (O implies that for large n, 

r+l ^ 1 -(\og^„) IN ^ logn 
A^ " logA^ " 2n2 ■ 

We are now done, because N/(r + 1) and (A^ + l)/r are within a constant factor of each other. □ 



5. Vadhan's variation 

After my talk at FOCS 2008, Salil Vadhan showed me a different way to prove the main result of this paper, 
assuming only that SAT is hard (and therefore, of course, separating P I poly from NP but not from a subclass of NP). 
With his kind permission, I include his argument here. 
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Theorem 5. Assume that SAT is not computable by circuits of size 2" . Let y be a function such that j{n) outstrips 
(logn)'^"^. and let q(n) — 2'''"^ Then there exists a sublinear-natural property of density 2"''*"' that separates NP from 
PI poly. 

Proof. To ease notation, let m - yin). Fix some way of encoding SAT instances as binary strings. Let C„ comprise 
all Boolean functions f of n variables with the following property. If the last n-m bits of x are all zero, then f{x) is 1 
or according to whether or not the first m bits of x encode a satisfiable instance of SAT. (If any of the last n-m bits 
of X are nonzero, then f(x) can be anything.) Then C„ has density 1/2^ ' - 2"'*"\ By our assumption on the hardness 
of SAT, functions in C„ cannot be computed by circuits of size 2™'. Since m'^ grows faster than li log n for any fixed d, 
this shows that (C„) is useful against PI poly. Checking membership in C„ can be done in time poly(m) ■ 2™, which is 
certainly subhnear in 2". □ 

6. Final remarks 

It is natural to ask if our results give any new hope for proving strong circuit lower bounds|f| It is probably 
difficult to prove unconditionally that, say, n'°=" -discrimination is useful against a strong complexity class A, not only 
because that would separate NP from A, but also because y-discrimination is closely related to the circuit minimization 
problem, whose complexity is known to be difficult to get a handle on; see 

However, even as a potential candidate for an almost-natural proof of NP ^ PI poly, y-discrimination has an 
illuminating feature. Namely, the only thing that prevents a y-discriminating function from looking like a random 
function is the presence of certain forced O's in the truth table. Moreover, the proportion of forced O's goes to zero 
fairly rapidly as n goes to infinity. This illustrates the fact that largeness can be destroyed by imposing what seems 
intuitively to be a relatively small amount of "structure" on a random function. Therefore, the intuition that there is 
some constructive property of random functions that suffices to prove strong circuit lower bounds is not completely 
destroyed by the Razborov-Rudich results; a minor alteration of a random property may still work. 

It is also worth noting that existing circuit lower bound proofs might still be mined for ideas to break the natural- 
ization barrier Some linear lower bounds, such as those of Blum 1 3] and Lachish and Raz |5^, do not relativize and are 
not known to naturalize. Even proofs that are known to naturalize are not necessarily devoid of useful ideas. For ex- 
ample, in the course of analyzing a proof by Smolensky, Razborov and Rudich identify three properties Ci c C2 £ C3 
that are implicit in the proof, and show that C2, and a fortiori C3, are natural. However, Ci is constructive but not 
known to be large, so it is conceivable (though admittedly unlikely) that Ci is only almost large and is actually useful. 
Of course, one would still need to identify and use some feature of Ci that is not shared by C2 in order to prove a 
stronger circuit lower bound than Smolensky's, but the point is that the usefulness of Ci is not automatically ruled out 
by the fact that Smolensky's argument naturalizes. In theory, it could still be fruitful to study Ci . 

Finally, recall that as evidence that largeness is hard to circumvent, Razborov and Rudich showed that any formal 
complexity measure automatically yields a large property. Knowing that almost-natural proofs exist, we could perhaps 
try to come up with something that is almost, but not quite, a formal complexity measure. Unfortunately, as of now, 
this tempting idea remains purely speculative. 
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